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DESCRIPTION 



SPECTRUM CODING APPARATUS , SPECTRUM DECODING APPARATUS, 
ACOUSTIC SIGNAL TRANSMISSION APPARATUS, 
5 ACOUSTIC SIGNAL RECEPTION APPARATUS AND METHODS THEREOF 



Technical Fields 

The present invention relates to a method of 
extending a frequency band of an audio signal or voice 
10 signal and improving sound quality, and further to a coding 
method and decoding method of an audio signal or voice 
signal applying this method. 

Background Art 

15 A voice coding technique and audio coding technique 

which compresses a voice signal or audio signal at a low 
bit rate are important for the effective utilization of 
a transmission path capacity of radio wave or the like 
in a mobile communication and a recording medium. 

20 Voice coding for coding a voice signal includes 

schemes such as G726 and G729 standardized in the ITU-T 
(International Telecommunication Union 

Telecommunication Standardization Sector) . These 
schemes target narrow band signals (300 Hz to 3.4 kHz) 

25 and can perform high quality coding at 8 kbits/s to 32 
kbits/s. However, because such a narrow band signal has 
a frequency band as narrow as a maximum of 3.4 kHz, and 



as for quality, sound is muffled and lacks a sense of 
realism . 

On the other hand, in the field of voice coding, 
there is a scheme which targets a wideband signal (50 
5 Hz to 7 kHz) for coding. Typical example s of such a me thod 
include G722, G722.1 of the ITU-T and AMR- WB of the 3GPP 
(The 3rdGeneration Partnership Project) and so on. These 
schemes can perform coding on a wideband voice signal 
at abit rate of 6. 6 kbits/s to 64 kbits/s. When the signal 

10 to be coded is a voice, a wideband signal has relatively 
high quality, but it is not sufficient when an audio signal 
is the target or when a quality with a high sense of realism 
is required for the voice signal. 

Generally, when a maximum frequency of a signal is 

15 approximately 10 to 15 kHz, a sense of realism equivalent 
to that of FM radio is obtained and quality comparable 
to that of a CD is obtained if the frequency is on the 
order of 20 kHz. Audio coding represented by the layer 
3 scheme and the AAC scheme standardized in MPEG (Moving 

20 Picture Expert Group) and so on is suitable for such a 
signal. However, in case of these audio coding schemes, 
the bit rate increases because the frequency band to be 
coded is widened. 

The National Publication of International Patent 

25 Application No . 2 0 0 1 - 5 2 1 6 4 8 describes a technique of 
reducing an overall bit rate by dividing an input signal 
into a low- frequency band and a high-frequency band and 
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substituting the high-frequency band by a low-frequency 
band spectrum as the method of coding a wideband signal 
at a low bit rate and with high quality. The state of 
processing when this conventional technique is applied 
5 to an original signal will be explained using FIGs . 1A 
to D. Here, a case where a conventional technique is 
applied to an original signal will be explained to 
facilitate explanations . In FIGs . 1A to D, the horizontal 
axis shows a frequency and the vertical axis shows a 

10 logarithmic power spectrum. Furthermore, FIG.1A shows 
a logarithmic power spectrum of the original signal when 
a frequency band is limited to 0^k<FH, FIG. IB shows a 
logarithmic power spectrum when the band of the same signal 
is limited to 0^k<FL ( FL< FH ) , FIG.1C shows a case where 

15 a spectrum in a high-frequency band is substituted by 
a spectrum in a low- frequency band using the conventional 
technique and FIG. ID shows a case where the substituted 
spectrum is reshaped according to spectral outline 
information. According to the conventional technique, 

20 the spectrum of the original signal (FIG.1A) is expressed 
based on a signal having a spectrum of 0^k<FL (FIG. IB), 
and therefore the spectrum of the high-frequency band 
(FL^K<FH in this figure) is substituted by the spectrum 
of the low- frequency band (0 ^ k< FL ) (FIG.1C) . For 

25 simplicity, a case assuming that there is a relationship 
of FL= FH / 2 is explained. Next, the amplitude value of 
the substituted spectrum in the high-frequency band is 
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adjusted according to the spectrum envelope information 
of the original signal and a spectrum obtained by 
estimating the spectrum of the original signal is 
determined (FIG. ID) . 

5 

Disclosure of Invention 

Generally, the spectrum of a voice signal or an audio 
signal is known to have a harmonic structure in which 
a spectral peak appears at an integer mult iple of a certain 

10 frequency as shown in FIG.2A. The harmonic structure is 
important information in maintaining quality and when 
a gap occurs in the harmonic structure, a quality 
degradation is perceived. FIG. 2 A shows a spectrum when 
the spectrum of some audio signal is analyzed. As seen 

15 in this figure, a harmonic structure with interval T is 
observed in the original signal. Here, a diagram showing 
that the spectrum of the original signal is estimated 
according to the conventional technique is shown in FIG.2B 
When these two figures are compared, it is observed that 

20 while the harmonic structure is maintained in the 
low- frequency band spectrum in the substitution source 
(area Al) and the high-frequency band spectrum (area A2 ) 
in the substitution destination in FIG.2B, the harmonic 
structure collapses in the connection section (area A3) 

25 of the low- frequency band spectrum of the substitution 
source and the high-frequency band spectrum in the 
substitution destination. This is attributable to the 
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fact that the conventional technique performs 
substitution without considering the shape of the 
harmonic structure. The subjective quality deteriorates 
due to such disturbance of the harmonic structure when 
5 an estimated spectrum is converted to a time signal and 
listened . 

Furthermore, when FL is smaller than FH/2, that is, 
when it is necessary to substitute the low- frequency band 
spectrum twice or more in the band of FL ^ k< FH , another 

10 problem occurs in adjustment of the spectral outline. 
The problem will be explained using FIG.3A and FIG.3B. 
The spectrum of a voice signal or audio signal is generally 
not flat and the energy of either the low- frequency band 
or the high-frequency band is larger. In this way, there 

15 is an tilt in the spectrum of a voice signal or audio 
signal and the energy of the high-frequency band is often 
smaller than the energy of the low- frequency band. When 
substitution of the spectrum is performed in such a 
situation, discontinuity of the spectral energy occurs 

20 (FIG.3A). As shown in FIG.3A, when a spectral outline 
is adjusted every predetermined period (subband), the 
discontinuity of the energy is not canceled (area A4 and 
area A5 in FIG.3B) , annoying sound occurs in the decoded 
signal because of this phenomenon and subjective quality 

25 deteriorates . 

In view of the above described problems, the present 
invention proposes a technique of coding a signal of a 
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wide frequency band at a lowbit rate and with high quality . 

The present invention provides a spectrum coding 
method of estimating the shape of the spectrum of the 
high-frequency band using a filter having the 
5 low- frequency band as the internal state and coding the 
coefficient representing the characteristic of the filter 
at that time to adjust a spectral outline of the estimated 
high-frequency band spectrum. This makes it possible to 
improve quality of a decoded signal. 

10 

Brief Description of Drawings 

FIG.1A shows a conventional bit rate compression 
technique ; 

FIG. IB shows a conventional bit rate compression 
15 technique; 

FIG.1C shows a conventional bit rate compression 
technique ; 

FIG. ID shows a conventional bit rate compression 
technique; 

20 FIG.2A shows a harmonic structure of a spectrum of 

a voice signal or audio signal; 

FIG.2B shows a harmonic structure of a spectrum of 
a voice signal or audio signal; 

FIG.3A shows discontinuity of energy produced when 
25 adjusting the spectral outline; 

FIG.3B shows discontinuity of energy produced when 
adjusting the spectral outline; 
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FIG. 4 illustrates a block diagram showing the 
configuration of a spectrum coding apparatus according 
to Embodiment 1; 

FIG. 5 illustrates a process of calculating an 
estimated value of a second spectrum through filtering; 

FIG. 6 illustrates a processing flow at the filtering 
section, search section and pitch coefficient setting 
section ; 

FIG.7A shows an example of the state of filtering; 

FIG.7B shows an example of the state of filtering; 

FIG.7C shows an example of the state of filtering; 

FIG.7D shows an example of the state of filtering; 

FIG.7E shows an example of the state of filtering; 

FIG.8A shows another example of the harmonic 
structure of a first spectrum stored in the internal state; 

FIG.8B shows a further example of the harmonic 
structure of the first spectrum stored in the internal 
state; 

FIG.8C shows a still further example of the harmonic 
structure of the first spectrum stored in the internal 
state; 

FIG.8D shows a still further example of the harmonic 
structure of the first spectrum stored in the internal 
state; 

FIG.8E shows a still further example of the harmonic 
structure of the first spectrum stored in the internal 
state; 
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FIG. 9 is a block diagram showing the configuration 
of a spectrum coding apparatus according to Embodiment 
2; 

FIG. 10 illustrates a state of filtering according 
5 to Embodiment 2; 

FIG. 11 is a block diagram showing the configuration 
of a spectrum coding apparatus according to Embodiment 
3; 

FIG. 12 illustrates a state of processing of 
10 Embodiment 3; 

FIG. 13 is a block diagram showing the configuration 
of a spectrum coding apparatus according to Embodiment 
4; 

FIG. 14 is a block diagram showing the configuration 
15 of a spectrum coding apparatus according to Embodiment 
5; 

FIG. 15 is a block diagram showing the configuration 
of a spectrum coding apparatus according to Embodiment 
6; 

20 FIG. 16 is a block diagram showing the configuration 

of a spectrum coding apparatus according to Embodiment 
7; 

FIG. 17 is a block diagram showing the configuration 
of a hierarchic coding apparatus according to Embodiment 
25 7 ; 

FIG. 18 is a block diagram showing the configuration 
of a hierarchic coding apparatus according to Embodiment 

8 
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FIG. 19 is a block diagram showing the configuration 
of a spectrum decoding apparatus according to Embodiment 
9; 

5 FIG. 20 illustrates the state of a decoded spectrum 

generated from the filtering section according to 
Embodiment 9; 

FIG. 21 is a block diagram showing the configuration 
of a spectrum decoding apparatus according to Embodiment 
10 10; 

FIG. 22 is a flow chart of Embodiment 10; 

FIG. 23 is a block diagram showing the configuration 
of a spectrum decoding apparatus according to Embodiment 
11; 

15 FIG. 24 is a block diagram showing the configuration 

of a spectrum decoding apparatus according to Embodiment 
12; 

FIG. 25 is a block diagram showing the configuration 
of a hierarchic decoding apparatus according to 
20 Embodiment 13; 

FIG. 26 is a block diagram showing the configuration 
of the hierarchic decoding apparatus according to 
Embodiment 13; 

FIG. 27 is a block diagram showing the configuration 
25 of an acoustic signal coding apparatus according to 
Embodiment 14; 

FIG. 28 is a block diagram showing the configuration 
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of an acoustic signal decoding apparatus according to 
Embodiment 15; 

FIG. 29 is a block diagram showing the configuration 
of an acoustic signal transmission coding apparatus 
according to Embodiment 16; and 

FIG. 30 is a block diagram showing the configuration 
of an acoustic signal reception decoding apparatus 
according to Embodiment 17 of the present invention. 

Best Mode for Carrying out the Invention 

With reference now to the accompanying drawings, 
embodiments of the present invention will be explained 
in detail below. 

(Embodiment 1) 

FIG. 4 is a block diagram showing the configuration 
of spectrum coding apparatus 100 according to Embodiment 
1 of the present invention. 

A first signal whose effective frequency band is 
0^ k< FL is input from input terminal 102 and a second signal 
whose effective frequency band is 0^k<FH is input from 
input terminal 103. Next, frequency domain 

transformation section 104 performs a frequency 
transformation on the first signal input from input 
terminal 102, calculates first spectrum Sl(k) and 
frequency domain transformation section 105 performs a 
frequency transformation on the second signal input from 
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input terminal 103 and calculates second spectrum S2 (k) . 
Here, discrete Fourier transform ( D FT ) , discrete cosine 
transform (DCT), modified discrete cosine transform 
(MDCT) or the like can be applied as the frequency 
5 transformation method. 

Next, internal state setting section 106 sets an 
internal state of a filter used in filtering section 107 
using first spectrum Sl(k) . Filtering section 107 
performs filtering based on the internal state of the 

10 filter set by internal state setting section 106 and pitch 
coefficient T given from pitch coefficient setting 
section 109 and calculates estimated value D2(k) of the 
second spectrum. The process of calculating estimated 
value D2 (k) of the second spectrum through filtering will 

15 be explained using FIG. 5. In FIG . 5 , suppose the spectrum 
of 0^k<FH is called xx S(k)" for convenience. As shown 
in FIG. 5, first spectrum Sl(k) is stored in the area of 
0^Sk<FL in S(k) as the internal state of the filter and 
estimated value D2 (k) of the second spectrum is generated 

20 in the area of FL^k<FH. 

This embodiment will explain a case where a filter 
expressed by the following Expression (1) is used and 
T here denotes the coefficient given from coefficient 
setting section 109. Furthermore, suppose M=l in this 

25 explanation. 

n^—^ ... (i) 

i=-M 
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In the filtering processing, an estimated value is 
calculated by multiplying each frequency by corresponding 
coefficient Pi centered on a spectrum which is lower by 
frequency T in ascending order of frequency and adding 
up the multiplication results. 

S(k) = jr/3 rS (k-T-i)...(2) 

Processing according to Expression (2) is performed 
between FL^k<FH. S(k) (FL^k<FH) calculated as a result 
is used as estimated value D2(k) of the second spectrum. 

Search section 108 calculates a degree of similarity 
between second spectrum S2 ( k) given from frequency domain 
transformation section 105 and estimated value D2(k) of 
the second spectrum given from filtering section 107. 
There are various definitions of the degree of similarity 
and this embodiment will explain a case where filter 
coe f f icients p_i and Pi are assumed to be 0 and the degree 
of similarity calculated according to the following 
Expression (3) defined based on a minimum square error 
is used. In this method, filter coefficient Pi is 
determined after calculating optimum pitch coefficient 
T . 

f£W)^2(*)Y 
E = ZS2{kf L mm ( 3 ) 

k-FL 

Here, E denotes a square error between S2(k) and 
D2(k). Because the first term on the right side of 
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Expression (3) is a fixed value regardless of pitch 
coefficient T, pitch coefficient T which generates D2 (k) 
corresponding to a maximum of the second term on the 
right side of Expression (3) is searched. In this 
5 embodiment, the second term on the right side of 
Expression (3) will be referred to as a "degree of 
s imilar i ty . " 

Pitch coefficient setting section 109 has the 
function of outputting pitch coefficient T included in 

10 a predetermined search range TMIN to TMAX to filtering 
section 107 sequentially. Therefore, every time pitch 
coefficient T is given from pitch coefficient setting 
section 109, filtering section 107 clears S(k) in the 
range of FL^k<FH to zero and then performs filtering and 

15 search section 108 calculates a degree of similarity. 
Search section 108 determines pitch coefficient Tmax 
corresponding to a maximum degree of similarity 
calculated between TMIN and TMAX and gives pitch 
coefficient Tmax to filter coefficient calculation 

20 section 110, second spectrum estimated value generation 
section 115, spectral outline adjustment subband 
determining section 112 and multiplexing section 111. 
FIG. 6 shows the processing flow of filtering section 107, 
search section 108 and pitch coefficient setting section 

25 109. 

FIGs.7A to E show an example of filtering state for 
ease in understanding of this embodiment. FIG.7A shows 



the harmonic structure of the first spectrum stored in 
the internal state. FIGs.7B to D show the relationship 
between the harmonic structures of the estimated values 
of the second spectrum calculated by performing filtering 
using three types of pitch coefficients To, Ti, T 2 . 
According to this example, Ti whose shape is similar to 
second spectrum S2(k) is selected as pitch coefficient 
T whereby the harmonic structure is maintained (see FIG.7C 
and FIG . 7E) . 

Furthermore, FIGs . 8A to E show another example of 
the harmonic structure of the first spectrum stored in 
the internal state. In this example also, an estimated 
spectrum whereby the harmonic structure is maintained 
is calculated when pitch coefficient Ti is used and it 
is Ti that is output from search section 108 (see FIG.8C 
and FIG . 8E) . 

Next, filter coefficient calculation section 110 
determines filter coefficient Pi using pitch coefficient 
Tmax given from search section 108. Filter coefficient 
Pi is determined so as to minimize square distortion E 
which follows the following Expression (4) . 



Filter coefficient calculation section 110 stores 
a plurality of combinations of Pi (i=-l,0,l) as a table 
beforehand, determines a combination of Pi (i=-l,0,l) 
which minimizes square error E of Expression (4) and gives 
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the code to second spectrum estimated value generation 
section 115 and multiplexing section 111. 

Second spectrum estimated value generation section 
115 generates estimated value D2 (k) of the second spectrum 
5 according to Expression (1) using pitch coefficient Tmax 
and filter coefficient Pi and gives it to spectral outline 
adjustment coefficient coding section 113. 

Pitch coefficient Tmax is also given to spectral 
outline adjustment subband determining section 112. 
10 Spectral outline adjustment subband determining section 

112 determines a subband for spectral outline adjustment 
based on pitch coefficient Tmax. A j th subband can be 
expressed by the following Expression (5) using pitch 
coefficient Tmax. 

[ BL(j) = FL + (j-\\T . x 
15 1 (0<j<j)... (5) 

Here, BL(j) denotes a minimum frequency of the jth 
subband and BH ( j ) denotes a maximum frequency of the jth 
subband. Furthermore, the number of subbands J is 
expressed as a minimum integer corresponding to maximum 
20 frequency BH(J-l) of the (j-l)th subband that exceeds 
FH . The information about the spectral outline 

adjustment subband determined in this way is given to 
spectral outline adjustment coefficient coding section 

113 . 

25 Spectral outline adjustment coefficient coding 

section 113 calculates a spectral outline adjustment 
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coefficient and performs coding using the spectral 
outline adjustment subband information given from 
spectral outline adjustment subband determining section 
112, estimated value D2(k) of the second spectrum given 
5 from second spectrum estimated value generation section 
115 and second spectrum S2 (k) given from frequency domain 
transformation section 105. This embodiment will 
explain a case where the relevant spectrum outline 
information is expressed with spectral power for each 
10 subband. At this time, the spectral power of the jth 
subband is expressed by the following Expression (6) . 

BHU) 

BU)= E S2(kf... (6) 

k=BL(j) 

Here, BL(j) denotes a minimum frequency of the jth 
subband and BH(j) denotes a maximum frequency of the jth 

15 subband. The subband information of the second spectrum 
determined in this way is regarded as the spectral outline 
information of the second spectrum. Likewise, subband 
information b(j) of estimated value D2(k) of the second 
spectrum is calculated according to the following 

20 Expression (7), 

60)= B ^D2{kJ...{D 

k=BL{j) 

and amount of variation V (j ) is calculated for each subband 
according to the following Expression (8) . 
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Next, amount of variation V(j) is coded and the code 
is sent to multiplexing section 111. 

To calculate more detailed spectral outline 
information, the following method may also be applied. 
A spectral outline adjustment subband is further divided 
into subbands of a smaller bandwidth and a spectral outline 
adjustment coefficient is calculated for each subband. 
For example, when the jth subband is divided by division 
number N, 



10 VU ' n) = ^^ {Q<j<J 9 0<n<N)...{9) 

a vector of the Nth order spectrum adjustment coefficient 
is calculated for each subband using Expression (9) , this 
vector is vector-quantized and an index of a 
representative vector corresponding to minimum 
15 distortion is output to multiplexing section 111. Here, 
B(j,n) and b(j,n) are calculated as follows: 

BH{j,n) 

B(j,n) = ]T S2 ( k Y (0<j<J,0<n< N)... (10) 

k=BL(j,n) 
BHU,") 

bU,n)= £ D2(kf (0<j<J 9 0<n<N)...(H) 

k=BL(j,n) 

Furthermore, BL(j,n), BH(j,n) denote a minimum 
20 frequency and a maximum frequency of the nth division 
section of the jth subband respectively. 

Multiplexing section 111 multiplexes information 
about optimum pitch coefficient Tmax obtained from search 
section 108, information about the filter coefficient 
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obtained from filter coefficient calculation section 110 
and information about the spectral outline adjustment 
coefficient obtained from spectral outline adjustment 
coefficient coding section 113 and outputs the 
5 multiplexing result from* output terminal 114. 

This embodiment has explained when M=l in Expression 
(1), but M is not limited to this value and any integer 
equal to or more than 0 can be used. Furthermore, this 
embodiment has explained the case where frequency domain 
10 transformation sections 104,105 are used, but these are 
the components which are necessary when a time domain 
signal is input and the frequency domain transformation 
section is not necessary in a configuration in which a 
spectrum is input directly. 

15 

(Embodiment 2) 

FIG. 9 is a block diagram showing the configuration 
of spectrum coding apparatus 200 according to Embodiment 
2 of the present invention. Since this embodiment adopts 

20 a simple configuration for a filter used at a filtering 
section, it requires no filter coefficient calculation 
section and produces the effect that a second spectrum 
can be estimated with a small amount of calculation. In 
FIG. 9, components having the same names as those in FIG. 4 

25 have identical functions, and therefore detailed 
explanations of such components will be omitted. For 
example, spectral outline adjustment subband determining 
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section 112 in FIG. 4 has a name "spectral outline 
adjustment subband determining section" identical to the 
spectral outline adjustment subband determining section 
209 in FIG. 9, and therefore it has an identical function. 

The configuration of the filter used at filtering 
section 206 is a simplified one as shown in the following 
expression . 

P(z) = — ^...(12) 
1 -z 

Expression (12) corresponds to a filter expressed 
assuming M=0, (3o=l based on Expression (1) . The state 
of filtering in this case is shown in FIG. 10. In this 
way, estimated value D2(k) of the second spectrum can 
be obtained by sequentially copying spectra in the 
low- frequency band located apart by T. 

Furthermore, search section 207 determines optimum 
pitch coefficient Tmax by searching pitch coefficient 
T which corresponds to a minimum value in Expression (3) 
as in the case of Embodiment 1. Pitch coefficient Tmax 
obtained in this way is given to multiplexing section 
211 . 

This configuration assumes that a value temporarily 
generated by search section 207 for the search is used 
as estimated value D2(k) of the second spectrum given 
to spectral outline adjustment coefficient coding section 
210. Therefore, second spectrum estimated value D2(k) 
is given to spectral outline adjustment coefficient 
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coding section 210 from search section 207. 

(Embodiment 3) 

FIG. 11 is a block diagram showing the configuration 
5 of spectrum coding apparatus 300 according to Embodiment 
3 of the present invention. The features of this 
embodiment include dividing a band FL^k<FH is into a 
plurality of subbands beforehand, performing a search 
for pitch coefficient T , calculation of a filter 

10 coefficient and adjustment of a spectral outline for each 
subband and coding these pieces of information. 

This avoids the problem with discontinuity of 
spectral energy caused by a spectral tilt included in 
the spectrum in a band of 0^ k< FL which is the substitution 

15 source. In addition, coding is performed independently 
for each subband, and therefore it is possible to produce 
the effect of realizing an extension of a band of higher 
quality. Because the components in FIG. 11 having the same 
names as those in FIG. 4 have identical functions, detailed 

20 explanations of such components will be omitted. 

Subband division section 309 divides band FL^k<FH 
of second spectrum S2(k) given from frequency domain 
transformation section 304 into predetermined J subbands . 
This embodiment will be explained assuming J=4 . Subband 

25 division section 309 outputs spectrum S2(k) included in 
a 0th subband to terminal 310a. In the same way, spectra 
S2(k) included in a first subband, second subband and 

20 



third subband are output to terminals 310b, 310c and 310d 
respectively . 

Subband selection section 312 controls switching 
section 311 in such a way that the switching section 311 
5 selects terminal 310a, terminal 310b, terminal 310c and 
terminal 310d sequentially. In other words, subband 
selection section 312 sequentially selects the 0th 
subband, first subband, second subband and third subband 
and gives spectrum S2(k) to search section 307, filter 

10 coefficient calculation section 313 and spectral outline 
adjustment coefficient coding section 314 . Hereinafter, 
processing is performed in subband units, pitch 
coefficient Tmax, filter coefficient Pi and spectral 
outline adjustment coefficient are calculated for each 

15 subband and given to multiplexing section 315. Therefore, 
information about J pitch coefficients Tmax, information 
about J filter coefficients and information about J 
spectral outline adjustment coefficients are given to 
multiplexing section 315. 

20 Furthermore, since subbands are predetermined in 

this embodiment, the spectral outline adjustment subband 
determining section is not necessary. 

FIG. 12 illustrates the state of processing according 
to this embodiment. As shown in this figure, band FL 

25 ^ k<FH is divided into predetermined subbands, Tmax, (3i, 
Vq are calculated for each subband and sent to the 
multiplexing section respectively. This configuration 
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matches the bandwidth of a spectrum substituted from a 
low-frequency band spectrum with the bandwidth of the 
subband for spectral outline adjustment, which results 
in preventing discontinuity of spectral energy and 
5 improving sound quality. 

(Embodiment 4) 

FIG. 13 is a block diagram showing the configuration 
of spectrum coding apparatus 400 according to Embodiment 

10 4 of the present invention. A feature of this embodiment 
includes simplifying the configuration of a filter used 
at a filtering section based on above described Embodiment 
3 . This eliminates the necessity for a filter coefficient 
calculation section and has the effect that a second 

15 spectrum can be estimated with a smaller amount of 
calculation. In FIG. 13, components having the same names 
as those in FIG. 11 have identical functions, and therefore 
detailed explanations of such components will be omitted. 

The configuration of the filter used at filtering 

20 section 406 is simplified as shown in the following 
expression . 

/>(*) = — ^...(13) 
1 — z 

Expression (13) corresponds to a filter which is 
expressed based on Expression (1) assuming M=0, Po=l- The 
25 state of filtering at this time is shown in FIG. 10. In 
this way, estimated value D2(k) of the second spectrum 
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can be determined by sequentially copying spectra in the 
low- frequency band located apart by T. Furthermore, 
search section 407 searches for pitch coefficient T which 
corresponds to a minimum value in Expression (3) and 
5 determines it as optimum pitch coefficient Tmax as in 
the case of Embodiment 1 . Pitch coefficient Tmax obtained 
in this way is given to multiplexing section 414. 

This configuration assumes that a value temporarily 
generated for a search by search section 407 is used as 
10 estimated value D2(k) of the second spectrum given to 
spectral outline adjustment coefficient coding section 
413. Therefore, second spectrum estimated value D2(k) 
is given to spectral outline adjustment coefficient 
coding section 413 from search section 407. 

15 

(Embodiment 5) 

FIG. 14 is a block diagram showing the configuration 
of spectrum coding apparatus 500 according to Embodiment 
5 of the present invention. Features of this embodiment 

20 include correcting spectral tilts of first spectrumSl (k) 
and second spectrum S2(k) using an LPC spectrum 
respectively, and determining estimated value D2(k) of 
the second spectrum using the corrected spectra. This 
produces the effect of solving the problem of 

25 discontinuity of spectral energy . In FIG. 14, components 
having the same names as those in FIG. 13 have identical 
functions, and therefore detailed explanations of such 
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components will be omitted. Moreover, this embodiment 
will explain a case where a technique of correcting 
spectral tilts is applied to above described Embodiment 
4, but this technique is not limited to this and is also 
applicable to each of above described Embodiments 1 to 
3 . 

Here, LPC coefficients calculated by an LPC analysis 
section (not shown here) or LPC decoding section is input 
from input terminal 505 and given to LPC spectrum 
calculation section 506. Apart from this, the 

configuration may also be adapted such that the LPC 
coefficients is determined by performing an LPC analysis 
on the signal input from input terminal 501 . In this case, 
input terminal 505 is not necessary and the LPC analysis 
section is newly added instead. 

LPC spectrum calculation section 506 calculates a 
spectrum envelope according to Expression (14) shown 
below based on the LPC coefficients. 



el(*) = 



1 



NP 



.2nki 



1 -]£>(/)• <f 



••(14) 



Or the spectrum envelope may also be calculated 
according to the following Expression (15) . 



1 



i-5>(oyv 



2*fa 



...(15) 



Here, a denotes LPC coefficients, NP denotes the 
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order of the LPC coefficients and K denotes a spectral 
resolution . 

Furthermore, y is a constant equal to or greater 
than 0 and less than 1 and the use of this y can smooth 
5 the shape of the spectrum. 

Spectrum envelope el(k) obtained in this way is given 
to spectral tilt correction section 507. 

Spectral tilt correction section 507 corrects 
spectral tilt which is present in first spectrum Sl(k) 
10 given from frequency domain transformation section 503 
using spectrum envelope el (k) obtained from LPC spectrum 
calculation section 506 according to the following 
Expre s s ion (16). 

SW(*) = £H& (16) 
el(k) 

15 The corrected first spectrum obtained in this way 

is given to internal state setting section 511. 

On the other hand, similar processing will also be 
performed when calculating the second spectrum . A second 
signal input from input terminal 502 is given to LPC 

20 analysis section 508 and performed an LPC analysis to 
obtain LPC coefficients. The LPC coefficients obtained 
here are converted to parameters which are suitable for 
coding such as LSP coefficients, then coded and an index 
thereof is given to multiplexing section 521. 

25 Simultaneously, the LPC coefficients are decoded and the 
decoded LPC coefficients are given to LPC spectrum 

25 



calculation section 509. LPC spectrum calculation 
section 509 has a function similar to that of above 
described LPC spectrum calculation section 506 and 
calculates spectrum envelope e2 (k) for the second signal 
5 according to Expression (14) or Expression (15). 
Spectral tilt correction section 510 has a function 
similar to that of above described spectral tilt 
correction section 507 and corrects the spectral tilt 
which is present in the second spectrum according to the 
10 following Expression (17) . 

S2rtew(k) = ^-...(H) 

The corrected second spectrum obtained in this way 
is given to search section 513 and at the same time given 
to spectral tilt assignment section 519. 

15 Spectral tilt assignment section 519 assigns a 

spectral tilt to estimated value D2(k) of the second 
spectrum given from search section 513 according to the 
following Expression (18) . 

D2new(k) = D2(k) • e2(k) ...(18) 

20 Estimated value s2new(k) of the second spectrum 

calculated in this way is given to spectral outline 
adjustment coefficient coding section 520. 

Multiplexing section 521 multiplexes information 
about pitch coefficient Tmax given from search section 

25 513, information about an adjustment coefficient given 
from spectral outline adjustment coefficient coding 
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section 520 and coding information about the LPC 
coefficients given from the LPC analysis section, and 
outputs the multiplexing result from output terminal 522 . 



5 (Embodiment 6) 

FIG. 15 is a block diagram showing the configuration 
of spectrum coding apparatus 600 according to Embodiment 
6 of the present invention. Features of this embodiment 
include detecting a band in which the shape of a spectrum 

10 is relatively flat from within first spectrum Sl(k) and 
searching pitch coefficient T from this flat band. This 
makes it less likely that the energy of the spectrum after 
substitution may become discontinuous and produces the 
effect of avoiding the problem of discontinuity of 

15 spectral energy. In FIG. 15, components having the same 
names as those in FIG. 13 have identical functions, and 
therefore detailed explanations of such components will 
be omitted. Furthermore, this embodiment will explain 
a case where a technique of correcting spectral tilts 

20 is applied to aforementioned Embodiment 4, but this 
technique is not limited to this and is also applicable 
to each of the aforementioned embodiments. 

First spectrum Sl(k) is given to spectral flat part 
detection section 605 from frequency domain 

25 transformation section 603 and a band in which the spectrum 
has the flat shape is detected from first spectrum SI (k) . 
Spectral flat part detection section 605 divides first 
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spectrum Sl(k) in band 0^ k<FL into a plurality of subbands, 
quantifies the amount of spectral variation of each 
subband and detects a subband with the smallest amount 
of spectral variation. The information indicating the 
subband is given to pitch coefficient setting section 
609 and multiplexing section 615. 

This embodiment will explain a case where a variance 
of a spectrum included in a subband is used as means for 
quantifying the amount of spectral variation. Band 0 
±£k<FL is divided into N subbands and variance u(n) of 
spectrum Sl(k) included in each subband is calculated 
according to the following Expression (19). 



Here, BL(n) denotes a minimum frequency of an nth 
subband, BH(n) denotes a maximum frequency of the nth 
subband, Slmean denotes an average of the absolute value 
of the spectrum included in the nth subband. Here, the 
absolute value of the spectrum is taken because it is 
intended to detect a flat band from the standpoint of 
the amplitude value of the spectrum. 

Variances u(n) of the respective subbands obtained 
in this way are compared, a subband with the smallest 
variance is determined and variable n indicating the 
subband is given to pitch coefficient setting section 
609 and multiplexing section 615. 




«(«)= 



k=BL(n) 



•••(19) 



BH(n) + BL(n) + l 
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Pitch coefficient setting section 609 limits the 
search range of pitch coefficient T into the band of the 
subband determined by spectral flat part detection 
section 605 and determines a candidate of pitch 
5 coefficient T within the limited range. Because pitch 
coefficient T is determined from within the band where 
the variation of spectral energy is small in this way, 
the problem of discontinuity of spectral energy is reduced 
Multiplexing section 615 multiplexes information about 

10 pitch coefficient Tmax given from search section 608, 
information about an adjustment coefficient given from 
spectral outline adjustment coefficient coding section 
614 and information about a subband given from spectral 
flat part detection section 605, and outputs the 

15 multiplexing result from output terminal 616. 

( Embodiment 7 ) 

FIG. 16 is a block diagram showing the configuration 
of spectrum coding apparatus 700 according to Embodiment 

20 7 of the present invention. A feature of this embodiment 
includes adaptively changing the range for searching 
pitch coefficient T according to the degree of periodicity 
of an input s ignal . Inthisway, since no harmonic structure 
exists for a less periodic signal such as a silence part, 

25 problems are less likely to occur even when the search 
range is set to be very small. Furthermore, for a more 
periodic signal such as a voiced sound part, the range 
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for searching pitch coefficient T is changed according 
to the value of the pitch period at that time. This makes 
it possible to reduce the amount of information for 
expressing pitch coefficient T and reduce the bit rate. 
5 In FIG. 16 components having the same names as those in 
FIG. 13 have identical functions and therefore detailed 
explanations of such components will be omitted. 
Furthermore, this embodiment will explain a case where 
this technique is applied to above described Embodiment 

10 4, but this technique is not limited to this and is also 
applicable to each of the embodiments described so far. 

At least one of a parameter indicating the degree 
of the pitch periodicity and a parameter indicating the 
length of the pitch period is input from input terminal 

15 706. This embodiment will explain a case where a 
parameter indicating the degree of the pitch periodicity 
and a parameter indicating the length with pitch period 
are input. Furthermore, this embodiment will be 
explained assuming that pitch period P and pitch gain 

20 Pg obtained by an adaptive codebook search by CELP (not 
shown) are input from input terminal 706. 

Search range determining section 707 determines a 
search range using pitch period P and pitch gain Pg given 
from input terminal 706. First, search range determining 

25 section 707 judges the degree of the periodicity of the 
input signal based on the magnitude of pitch gain Pg . 
When pitch gain Pg is larger than a threshold, the input 
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signal input from input terminal 701 is regarded as a 
voiced sound part and TMIN and TMAX indicating the search 
range of pitch coefficient T are determined so as to include 
at least one harmonic of the harmonic structure expressed 
5 by pitch period P. Therefore, when the frequency of pitch 
period P is large, the search range of pitch coefficient 
T is set to be wide, and on the contrary when the frequency 
of pitch period P is small, the search range of pitch 
coefficient T is set to be narrow. 
10 When pitch gain Pg is smaller than the threshold, 

the input signal input from input terminal 701 is assumed 
to be a silence part and no harmonic structure is assumed 
to exist, and therefore the search range for searching 
pitch coefficient T is set to be very narrow. 

15 

( Embodiment 8 ) 

FIG. 17 is a block diagram showing the configuration 

of hierarchical coding apparatus 800 according to 

Embodiment 8 of the present invention. This embodiment 
20 applies any one of above described Embodiments 1 to 7 

to hierarchical coding, and can thereby code a voice signal 

or audio signal at a low bit rate 

Acoustic data is input from input terminal 801 and 

a low sampling rate signal is generated by downsampling 
25 section 802. The downsampled signal is given to first 

layer coding section 803 and the relevant signal is coded. 

The code of first layer coding section 803 is given to 
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multiplexing section 807 and is also given to first layer 
decoding section 804. First layer decoding section 804 
generates a first layer decoded signal based on the code. 

Next, upsampling section 805 raises the sampling 
5 rate of the decoded signal of first layer coding section 
803. Delay section 806 gives a delay of a specific length 
to the input signal input from input terminal 801. The 
magnitude of this delay is set to the same value as the 
time delay produced by downsampling section 802, first 

10 layer coding section 803, first layer decoding section 
804 and upsampling section 805. 

Any one of above described Embodiments 1 to 7 is 
applied to spectrum coding section 101, spectrum coding 
is performed using the signal obtained from upsampling 

15 section 805 as a first signal and the signal obtained 
from delay section 806 as a second signal and the codes 
are output to multiplexing section 807. 

The code obtained from first layer coding section 
803 and the code obtained from spectrum coding section 

20 101 are multiplexed by multiplexing section 807 and are 
output from output terminal 808 as the output code. 

When the configuration of spectrum coding section 
101 is the one shown in FIG. 14 and FIG. 16, the configuration 
of hierarchical coding apparatus 800a according to this 

25 embodiment (lowercase alphabet is appended to distinguish 
it from hierarchical coding apparatus 800 shown in FIG. 17) 
is as shown in FIG. 18. The difference between FIG. 18 and 
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FIG. 17 is that a signal line which is directly input from 
first layer decoding section 804a is added to spectral 
coding section 101. This shows that the LPC coefficients 
decoded by first layer decoding section 804 or pitch period 
P and pitch gain Pg are given to spectral coding section 
101 . 

(Embodiment 9) 

FIG. 19 is a block diagram showing the configuration 
of spectrum decoding apparatus 1000 according to 
Embodiment 9 of the present invention. 

In this embodiment, it is possible to estimate the 
high-frequency component of a second spectrum by a filter 
based on a first spectrum and decode a generated code, 
thereby decode an accurately estimated spectrum, adjust 
a spectral outline of the estimated spectrum of the 
high-frequency band with an appropriate subband and 
thereby achieve the effect of improving the quality of 
the decoded signal. The code coded by a spectrum coding 
section (not shown here) is input from input terminal 
1002 and is given to separation section 1003. Separation 
section 1003 gives information about a filter coefficient 
to filtering section 1007 and spectral outline adjustment 
subband determining section 1008. At the same time, it 
gives information about a spectral outline adjustment 
coefficient to spectral outline adjustment coefficient 
decoding section 1009. 
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Moreover, a first signal whose effective frequency 
band is 0^k<FL is input from input terminal 1004 and 
frequency domain transformation section 1005 performs 
a frequency transformation on a time domain signal input 
5 from input terminal 1004 and calculates first spectrum 
Sl(k) . Here, as the frequency transformation method, 
a discrete Fourier transform (DFT), discrete cosine 
transform (DCT), modified discrete cosine transform 
(MDCT) and so on can be used. 

10 Next, internal state setting section 1006 sets the 

internal state of a filter used at filtering section 1007 
using first spectrum Sl(k) . Filtering section 1007 
performs filtering based on the internal state of the 
filter set by internal state setting section 1006, pitch 

15 coefficient Tmax given from separation section 1003 and 
filter coefficient (3 and calculates estimated value D2 (k) 
of the second spectrum. In this case, at filtering 
section 1007, the filter described in Expression (1) is 
used. Furthermore, when the filter described in 

20 Expression (12) is used, it is only pitch coefficient 
Tmax that is given from separation section 1003. Which 
filter should be used corresponds to the type of the filter 
used by the spectrum coding section (not shown here) and 
the filter identical to that filter is used. 

25 The state of decoded spectrum D(k) generated from 

filtering section 1007 is shown in FIG. 20. As shown in 
FIG. 20, decoding spectrum D(k) consists of first spectrum 
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SI (k) in frequency band 0^k<FL and estimated value D2 ( k) 
of the second spectrum in frequency band FL^k<FH. 

Spectral outline adjustment subband determining 
section 1008 determines the subband for adjusting a 
5 spectral outline using pitch coefficient Tmax given from 
separation section 1003. A jth subband can be expressed 
as shown in the following Expression (20) using pitch 
coefficient Tmax . 

V } max (0< j<j)... (20) 
1 BH (j) = FL + j - T max y J } 

10 Here, BL(j) denotes a minimum frequency of the jth 

subband and BH ( j ) denotes a maximum frequency of the jth 
subband. Furthermore, the number of subbands J is 
expressed as a minimum integer corresponding to maximum 
frequency BH(J-l) of the (J-l)th subband that exceeds 

15 FH . The information about the spectral outline 

adjustment subband determined in this way is given to 
spectrum adjustment section 1010. 

Spectral outline adjustment coefficient decoding 
section 1009 decodes a spectral outline adjustment 

20 coefficient based on the information about the spectral 
outline adjustment coefficient given from separation 
section 1003 and gives this decoded spectral outline 
adjustment coefficient to spectrum adjustment section 
1010. Here, the spectral outline adjustment coefficient 

25 quantizes the amount of variation for each subband 
expressed by Expression (8) and t hen expre s se s the de coded 
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value Vq ( j ) . 

Spe ct rum ad j us tment section 1010 mul t ipl ie s decoded 
spectrum D(k) obtained from filtering section 1007 by 
decoded value Vq(j) of the amount of variation for each 
subband decoded by spectral outline adjustment 
coefficient decoding section 1009 on the subband given 
from spectral outline adjustment subband determining 
section 1008 according to the following Expression (21) , 
thereby adjusts the spectral shape of frequency band FL 
^ k<FH of decoded spectrum D(k) and generates decoded 
spectrum S3(k) after adjustment. 

S3(k) = D{k) V q {j) (BL(j)< k < BH{j\forall j)... (21) 

This decoded spectrum S3(k) is given to time domain 
conversion section 1011, converted to a time domain signal 
and output from output terminal 1012. When converting 
decoded spectrum S3 ( k ) to a time domain signal , time domain 
conversion section 1011 performs appropriate processing 
such as windowing and overlap-add as required and avoids 
discontinuity which occurs among frames . 

(Embodiment 10) 

FIG. 21 is a block diagram showing the configuration 
of spectrum decoding apparatus 1100 according to 
Embodiment 10 of the present invention. A feature of this 
embodiment includes dividing a band of FL^k<FH into a 
plurality of subbands beforehand so that a spectrum can 
be decoded using information about each subband. This 
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avoids the problem of discontinuity of spectral energy 
caused by spectral tilts included in the spectrum in a 
band of 0^Sk<FL which is the substitution source. In 
addition, it is possible to decode a code which is coded 
5 for each subband independently and generate a high quality 
decoded signal. In FIG. 21, components having the same 
names as those in FIG. 19 have identical functions, and 
therefore detailed explanations of such components will 
be omitted. 

10 In this embodiment, band FL^k<FH is divided into 

predetermined J subbands as shown in FIG. 12, and pitch 
coefficient Tmax, filter coefficient |3 and spectral 
outline adjustment coefficient Vq which are coded for 
each subband are decoded to generate a voice signal. Or 

15 pitch coefficient Tmax and spectral outline adjustment 
coefficient Vq which are coded for each subband are decoded 
to generate a voice signal. Which technique should be 
adopted depends on the kind of the filter used at the 
spectral coding section (not shown here) . The filter in 

20 Expression (1) is used in the former case and the filter 
in Expression (12) is used in the latter case. 

First spectrum SI (k) is stored in band 0^k<FL from 
spectrum adj ustment section 1108 and as for band FL^ k< FH , 
the spectrum after spectral outline adjustment which has 

25 been divided into J subbands is given to subband 
integration section 1109. Subband integration section 
1109 combines these spectra and generates decoded 
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spectrum D(k) as shown in FIG. 20. Decoding spectrum D ( k) 
generated in this way is given to time domain conversion 
section 1110. The flow chart of this embodiment is shown 
in FIG . 22 . 

5 

(Embodiment 11) 

FIG. 23 is a block diagram showing the configuration 
of spectrum decoding apparatus 1200 according to 
Embodiment 11 of the present invention. Features of this 

10 embodiment include correcting spectral tilts of first 
spectrum Sl(k) and second spectrum S2(k) using an LPC 
spectrum respectively and decoding a code that can be 
obtained by calculating estimated value D2(k) of the 
second spectrum using the corrected spectra. This makes 

15 it possible to obtain a spectrum free of the problem of 
discontinuity of spectral energy and produces the effect 
of generating a high quality decoded signal. In FIG. 23, 
components having the same names as those in FIG. 21 have 
identical functions, and therefore detailed explanations 

20 of such components will be omitted. Furthermore, this 
embodiment will explain a case where a technique of 
correcting spectral tilts is applied to above described 
Embodiment 10, but this technique is not limited to this 
and is also applicable to above described Embodiment 9. 

25 LPC coefficient decoding section 1210 decodes LPC 

coefficients based on information about the LPC 
coefficients given from separation section 12 02 and gives 
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the LPC coefficients to LPC spectrum calculation section 
12 11 . The processing by LPC coefficient decoding section 
1210 depends on the coding processing on the LPC 
coefficients which is performed inside the LPC analysis 
5 section of a coding sect ion ( no t shown he re ) and proces s ing 
of decoding the code obtained through the coding 
processing there is performed. LPC spectrum calculation 
section 1211 calculates the LPC spectrum according to 
Expression (14) or Expression (15) . The same method as 

10 that used by the LPC spectrum calculation section of the 
coding section (not shown here) can be used to determine 
which method should be used. The LPC spectrum calculated 
by LPC spectrum calculation section 1211 is given to 
spectral tilt assignment section 1209. 

15 On the other hand, the LPC coefficients calculated 

by the LPC decoding section (not shown here) or the LPC 
calculation section is input from input terminal 1215 
and is given to LPC spectrum calculation section 1216. 
LPC spectrum calculation section 1216 calculates the LPC 

20 spectrum according to Expression (14) or Expression (15) . 
Which expression should be used depends on what method 
is used by the coding section (not shown here) . 

Spectral tilt assignment section 1209 multiplies 
decoded spectrum D(k) given from filtering section 1206 

25 by the spectral tilt according to the following Expression 
(22), and then gives decoded spectrum D(k) assigned a 
spectral tilt to spectrum adjustment section 1207. In 



Expression (22) , el (k) denotes the output of LPC spectrum 
calculation section 1216 and e2(k) denotes the output 
of LPC spectrum calculation section 1211. 

D2new(k) = • el{k) ...(22) 

5 

(Embodiment 12) 

FIG. 24 is a block diagram showing the configuration 
of spectrum decoding apparatus 1300 according to 
Embodiment 12 of the present invention. Feature of this 

10 embodiment include detecting a band in which the spectrum 
has a relatively flat shape from within first spectrum 
Sl(k) and decoding a code obtained by searching pitch 
coefficient T from this flat band. 

This prevents the energy of the spectrum after 

15 substitution from becoming discontinuous, can obtain a 
decoded spectrum free of the problem of discontinuity 
of spectral energy and produce the effect of generating 
a high quality decoded signal. In FIG. 24, components 
having the same names as those in FIG. 21 have identical 

20 functions, and therefore detailed explanations of such 
components will be omitted. Furthermore, this 

embodiment will explain a case where this technique is 
applied to above described Embodiment 10, but this 
technique is not limited to this and is also applicable 

25 to above described Embodiment 9 and Embodiment 11. 

Separation section 1302 gives subband selection 
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information n indicating which subband is selected out 
of the N subbands into which band 0^k<FL is divided and 
information indicating which position is used as the start 
point of the substitution source out of the frequencies 
included in the nth subband to pitch coefficient Tmax 
generation section 1303. Pitch coefficient Tmax 

generation section 1303 generates pitch coefficient Tmax 
used at filtering section 1307 based on these two pieces 
of information and gives pitch coefficient Tmax to 
filtering section 1307. 

(Embodiment 13) 

FIG. 25 is a block diagram showing the configuration 
of hierarchical decoding apparatus 1400 according to 
15 Embodiment 13 of the present invention. This embodiment 
applies any one of above described Embodiments 9 to 12 
to a hierarchical decoding method, and can thereby decode 
a code generated by the hierarchical coding method of 
above described Embodiment 8 and decode a high quality 
20 voice signal or audio signal. A code that is coded using 
a hierarchy signal coding method (not shown here) is input 
from input terminal 1401, separation section 1402 
separates the above described code and generates a code 
for the first layer decoding section and a code for the 
25 spectrum decoding section. First layer decoding section 
1403 decodes the decoded signal of sampling rate 2 • FL 
using the code obtained at separation section 1402 and 
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gives the decoded signal to upsampling section 1405. 
Upsampling section 1405 raises the sampling frequency 
of the first layer decoded signal given from first layer 
decoding section 1403 to 2 • FH . According to this 
5 configuration, when the first layer decoded signal 
generated by first layer decoding section 1403 needs to 
be output, the first layer decoded signal can be output 
from output terminal 1404. When the first layer decoded 
signal is not necessary, output terminal 1404 can be 

10 deleted from the configuration. 

The code separated by separation section 1402 and 
first layer decoded signal after upsampling generated 
by upsampling section 1405 are given to spectrum decoding 
section 1001. Spectrum decoding section 1001 performs 

15 spectrum decoding based on one of the methods according 
to above described Embodiments 9 to 12, generates a decoded 
signal of sampling frequency 2 • FH and outputs the signal 
from output terminal 1406. Spectrum decoding section 
1001 performs processing assuming the first layer decoded 

20 signal after the upsampling given from upsampling section 
1405 as a first signal. 

When the configuration of spectrum decoding section 
1001 is the one shown in FIG. 23, the configuration of 
hierarchical decoding apparatus 1400a according to this 

25 embodiment is as shown in FIG. 26. The difference between 
FIG. 25 and FIG. 26 is in that the signal line directly 
input from separation section 1402 is added to spectrum 
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decoding section 1001. This shows that the LPC 
coefficients decoded by separation section 1402 or pitch 
period P and pitch gain Pg are given to spectrum decoding 
section 1001 . 

5 

(Embodiment 14) 

Next, Embodiment 14 of the present invention will 
be explained with reference to drawings. FIG. 27 is a 
block diagram showing the configuration of acoustic 

10 signal coding apparatus 1500 according to Embodiment 14 
of the present invention. This embodiment is 

characterized in that acoustic coding apparatus 1504 in 
FIG. 27 is constructed of hierarchical coding apparatus 
800 shown in above described Embodiment 8. 

15 As shown in FIG. 27, acoustic signal coding apparatus 

1500 according to Embodiment 14 of the present invention 
is provided with input apparatus 1502, A/D conversion 
apparatus 1503 and acoustic coding apparatus 1504 which 
is connected to network 1505. 

20 The input terminal of A/D conversion apparatus 1503 

is connected to the output terminal of input apparatus 
1502. The input terminal of acoustic coding apparatus 
1504 is connected to the output terminal of A/D conversion 
apparatus 1503. The output terminal of acoustic coding 

25 apparatus 1504 is connected to network 1505. Input 
apparatus 1502 converts sound wave 1501 which is audible 
to human ears to an analog signal which is an electric 



signal and gives it to A/D conversion apparatus 1503. 
A/D conversion apparatus 1503 converts an analog signal 
to a digital signal and gives it to acoustic coding 
apparatus 1504. Acoustic coding apparatus 1504 codes an 
5 input digital signal, generates a code and outputs it 
to network 1505. 

According to Embodiment 14 of the present invention, 
it is possible to obtain the effect as shown in above 
described Embodiment 8 and provide an acoustic coding 
10 apparatus which codes an acoustic signal efficiently. 

(Embodiment 15) 

Next, Embodiment 15 of the present invention will 
be explained with reference to drawings. FIG. 28 is a 

15 block diagram showing the configuration of acoustic 
signal decoding apparatus 1600 according to Embodiment 
15 of the present invention. This embodiment is 
characterized in that acoustic decoding apparatus 1603 
shown in FIG. 28 is constructed of hierarchical decoding 

20 apparatus 1400 shown in above described Embodiment 13. 

As shown in FIG. 28, acoustic signal decoding 
apparatus 1600 according to Embodiment 15 of the present 
invention is provided with reception apparatus 1602 which 
is connected to network 1601, acoustic decoding apparatus 

25 1603, D/A conversion apparatus 1604 and output apparatus 
1605 . 

The input terminal of reception apparatus 1602 is 
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connected to network 1601. The input terminal of acoustic 
decoding apparatus 1603 is connected to the output 
terminal of reception apparatus 1602 . The input terminal 
of D/A conversion apparatus 1604 is connected to the output 
5 terminal of voice decoding apparatus 1603. The input 
terminal of output apparatus 1605 is connected to the 
output terminal of D/A conversion apparatus 1604. 

Reception apparatus 1602 receives a digital coded 
acoustic signal from network 1601, generates a digital 

10 reception acoustic signal and gives it to acoustic 
decoding apparatus 1603. Voice decoding apparatus 1603 
receives a reception acoustic signal from reception 
apparatus 1602, performs decoding processing on this 
reception acoustic signal, generates a digital decoded 

15 acoustic signal and gives it to D/A conversion apparatus 
1604 . D/A conversion apparatus 1604 converts the digital 
decoded voice signal from acoustic decoding apparatus 
1603, generates an analog decoded voice signal and gives 
it to output apparatus 1605. Output apparatus 1605 

20 converts the analog decoded acoustic signal which is an 
electric signal to vibration of the air and outputs it 
as sound wave 1606 audible to human ears. 

According to Embodiment 15 of the present invention, 
it is possible to obtain the effect as shown in above 

25 described Embodiment 13 and efficiently perform decoding 
the coded acoustic signal with a small number of bits 
and thereby output a high quality acoustic signal. 



(Embodiment 16) 

Next, Embodiment 16 of the present invention will 
be explained with reference to drawings. FIG. 29 is a 
5 block diagram showing the configuration of acoustic 
signal transmission coding apparatus 1700 according to 
Embodiment 16 of the present invention. Embodiment 16 
of the present invention is characterized in that acoustic 
coding apparatus 1704 in FIG. 29 is constructed of 

10 hierarchical coding apparatus 800 shown in above 
described Embodiment 8 . 

As shown in FIG. 29, Acoustic signal transmission 
coding apparatus 1700 according to Embodiment 16 of the 
present invention is provided with input apparatus 1702, 

15 A/D conversion apparatus 1703, acoustic coding apparatus 
1704, RF modulation apparatus 1705 and antenna 1706. 

Input apparatus 1702 converts sound wave 1701 which 
is audible to human ears to an analog signal which is 
an electric signal and gives it to A/D conversion apparatus 

20 1703. A/D conversion apparatus 1703 converts an analog 
signal to a digital signal and gives it to acoustic coding 
apparatus 1704 . Acoustic coding apparatus 1704 codes the 
input digital signal, generates a coded acoustic signal 
and gives it to RF modulation apparatus 1705. RF 

25 modulation apparatus 1705 modulates the coded acoustic 
signal, generates a modulated coded acoustic signal and 
gives it to antenna 1706. Antenna 1706 transmits the 
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modulated coded acoustic signal as radio wave 1707. 

According to Embodiment 16 of the present invention, 
it is possible to obtain the effect as shown in above 
described Embodiment 8 and efficiently code the acoustic 
5 signal with a small number of bits. 

The present invention can be applied to a 
transmission apparatus, transmission coding apparatus 
or acoustic signal coding apparatus that uses an audio 
signal. Furthermore, the present invention can also be 
10 applied to a mobile station apparatus or base station 
apparatus . 

(Embodiment 17) 

Next, Embodiment 17 of the present invention will 

15 be explained with reference to drawings. FIG. 30 is a 
block diagram showing the configuration of acoustic 
signal reception decoding apparatus 1800 according to 
Embodiment 17 of the present invention. Embodiment 17 
of the present invention is characterized in that acoustic 

20 decoding apparatus 1804 in FIG. 30 is constructed of 
hierarchical decoding apparatus 1400 shown in above 
described Embodiment 13. 

As shown in FIG. 30, acoustic signal reception 
decoding apparatus 1800 according to Embodiment 17 of 

25 the present invention is provided with antenna 1802, RF 
demodulation apparatus 1803, acoustic decoding apparatus 
1804, D/A conversion apparatus 1805 and output apparatus 
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1806 . 

Antenna 1802 receives a digital coded acoustic 
signal as radio wave 1801, generates a digital reception 
coded acoustic signal which is an electric signal and 
5 gives it to RF demodulation apparatus 1803. RF 
demodulation apparatus 1803 demodulates the reception 
coded acoustic signal from antenna 1802, generates a 
demodulated coded acoustic signal and gives it to acoustic 
decoding apparatus 1804. 

10 Acoustic decoding apparatus 1804 receives a digital 

demodulated coded acoustic signal from RF demodulation 
apparatus 1803, performs decoding processing, generates 
a digital decoded acoustic signal and gives it to D/A 
conversion apparatus 1805. D/A conversion apparatus 

15 1805 converts the digital decoded voice signal from 
acoustic decoding apparatus 1804, generates an analog 
decoded voice signal and gives it to output apparatus 
18 06. Output apparatus 1806 converts the analog decoded 
voice signal which is an electric signal to vibration 

20 of the air and outputs it as sound wave 1807 audible to 
human ears . 

According to the Embodiment 17 of the present 
invention, it is possible to obtain the effect as shown 
in above described Embodiment 13, decode a coded acoustic 
25 signal efficiently with a small number of bits and thereby 
output a high quality acoustic signal. 

As explained above, according to the present 
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invention, by estimating a high-frequency band of a second 
spectrum using a filter having a first spectrum as its 
internal state, coding a filter coefficient when the 
degree of similarity to the estimated value of the second 
5 spectrum becomes a maximum and adjusting a spectral 
outline with an appropriate subband, it is possible to 
code the spectrum at a low bit rate and with high quality. 
Moreover, by applying the present invention to 
hierarchical coding, a voice signal and audio signal can 

10 be coded at a low bit rate and with high quality. 

The present invention can be applied to a reception 
apparatus, reception decoding apparatus or voice signal 
decoding apparatus using an audio signal. Furthermore, 
the present invention can also be applied to a mobile 

15 station apparatus or base station apparatus. 

Furthermore, each function block employed in the 
description of each of the aforementioned embodiments 
may typically be implemented as an LSI constituted by 
an integrated circuit. These may be individual chips 

20 or partially or totally contained on a single chip. 

Furthermore, LSI is adopted here, but this may also 
be referred to as "IC", "system LSI", "super LSI" or "ultra 
LSI" depending on the differing extents of integration. 

Further, the method of circuit integration is not 

25 limited to LSI ' s, and implementation using dedicated 
circuitry or general purpose processors is also possible. 
After LSI manufacture, utilization of an FPGA (Field 
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Programmable Gate Array) or a reconf igurable processor 
where connections and settings of circuit cells within 
an LSI can be reconfigured is also possible. 

Further, if integrated circuit technology comes out 
5 to replace LSI's as a result of the advancement of 
semiconductor technology or a derivative other technology, 
it is naturally also possible to carry out function block 
integration using this technology. The adaptation of a 
biotechnology and so on may be considered as 

10 possibilities. 

A first mode of the spectrum coding method of the 
present invention is a spectrum coding method comprising 
a section for performing the frequency transformation 
of a first signal and calculating a first spectrum, a 

15 section for performing the frequency transformation of 
a second signal and calculating a second spectrum, a step 
of estimating the shape of the second spectrum in a band 
of FL^k<FH using a filter which has the first spectrum 
in a band of 0^k<FL as an internal state and a step of 

20 coding a coefficient indicating the filter characteristic 
at this time, wherein the outline of the second spectrum 
determined based on the coefficient indicating the filter 
characteristic is coded together. 

According to this configuration, it is only 

25 necessary to code the coefficient indicating the 
characteristic of the filter by estimating the 
high-frequency component of second spectrum S2(k) using 



t 
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the filter based on first spectrum Sl(k) and it is possible 
to estimate the high-frequency component of second 
spectrum S2(k) at a low bit rate and with high accuracy. 

Moreover, since a spectral outline is coded based 
5 on the coefficient indicating the characteristic of the 
filter, no discontinuity of energy of the spectrum occurs 
and thereby it is possible to improve quality. 

Furthermore, a second mode of the spectrum coding 
method of the present invention divides the second 

10 spectrum into a plurality of subbands and codes the 
coefficient indicating the characteristic of the filter 
and the outline of the spectrum for each subband. 

According to this configuration, by estimating the 
high-frequency component of second spectrum S2(k) using 

15 the filter based on first spectrum Sl(k), it is only 
necessary to code the coefficient indicating the 
characteristic of the filter and estimate the 
high-frequency component of second spectrum S2(k) at a 
low bit rate and with high accuracy. Furthermore, a 

20 plurality of subbands are predetermined and the 
coefficient indicating the filter characteristic and the 
outline of the filter are coded for each subband, and 
therefore it is possible to prevent discontinuity of 
energy of the spectrum and thereby improve quality. 

25 Furthermore, a third mode of the spectrum coding 

method of the present invention adopts the above described 
configuration in which the filter can be expressed by 
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P(z) = xj— ...(23) 

i=-M 

and estimation is performed using a zero-input response 
of the filter. 

According to this configuration, it is possible to 
5 prevent collapse of the harmonic structure caused with 
the estimated value of S2(k) and obtain the effect of 
improving qual i ty . 

Moreover, a fourthmode of the spectrum coding me thod 
of the present invention adopts the above described 
10 configuration in which M=0, (3o=l are assumed. 

According to this configuration, the characteristic 
of the filter is determined only by pitch coefficient 
T and it is possible to obtain the effect that the spectrum 
can be estimated at a low bit rate. 
15 Furthermore, a fifth mode of the spectrum coding 

method of the present invention adopts the above described 
configuration in which the outline of the spectrum is 
determined for each subband determined by pitch 
coefficient T . 

20 According to this configuration, since the band 

width of the subband is determined appropriately, it is 
possible to prevent discontinuity of energy of the 
spectrum and improve quality. 

Furthermore, a sixth mode of the spectrum coding 

25 method of the present invention adopts the above described 
configuration, in which the first signal is a signal coded 



and then decoded in a lower layer or a signal obtained 
by upsampling this signal and the second signal is an 
input signal . 

According to this configuration, it is possible to 
5 apply the present invention to hierarchical coding which 
is composed of a coding section with a plurality of layers 
and obtain the effect that an input signal can be coded 
at a low bit rate and with high quality. 

A first mode of the spectrum decoding method of the 

10 present invention is a spectrum decoding method 
comprising the steps of decoding a coefficient indicating 
the characteristic of a filter, performing the frequency 
transformation of a first signal to obtain a first spectrum 
and generating an estimated value of a second spectrum 

15 in a band of FL^£k<FH using the filter which has the first 
spectrum in a band of 0^k<FL as the internal state, in 
which the spectral outline of the second spectrum 
determined based on the coefficient indicating the 
characteristic of the filter is decoded together. 

20 According to this configuration, it is possible to 

decode the code obtained by estimating the high-frequency 
component of second spectrum S2 (k) using the filter based 
on first spectrum Sl(k) and thereby obtain the effect 
that the estimated value of the high-frequency component 

25 ofsecond spectrum S2(k) canbedecodedwithhighaccuracy. 
Furthermore, since the spectral outline coded based on 
the coefficient indicating the characteristic of the 
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filter can be decoded, discontinuity of energy of the 
spectrum no longer occurs andahigh quality decoded signal 
can be generated. 

Furthermore, a second mode of the spectrum decoding 
5 method of the present invention comprises the steps of 
dividing the second spectrum into a plurality of subbands 
and decoding a coefficient indicating the characteristic 
of the filter and the outline of the spectrum for each 
subband . 

10 According to this configuration, it is possible to 

decode the code which is coded by estimating the 
high-frequency component of second spectrum S2(k) using 
the filter based on first spectrum Sl(k) and thereby obtain 
the effect that the estimated value of the high-frequency 

15 component of second spectrum S2(k) can be decoded with 
high accuracy. Furthermore, it is possible to 

predetermine a plurality of subbands and decode the 
coefficient indicating the characteristic of the filter 
coded and outline of the spectrum for each subband, and 

20 thereby discontinuity of energy of the spectrum is 
prevented and a high quality decoded signal can be 
generated . 

Moreover, a third mode of the spectrum decoding 
method of the present invention adopts the above described 
25 configuration in which the filter is expressed 

P i z ) = TT -(23) 

i- 2>,*- r+f 
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and an estimated value is generated using a zero-input 
response of the filter. 

According to this configuration, it is possible to 
decode a code that is coded using the method of preventing 
5 collapse of the harmonic structure caused with the 
estimated value of S2(k) and thereby obtain the effect 
that decodes the estimated value of the spectrum with 
improved quality. 

Moreover, a fourth mode of the spectrum decoding 
10 method of the present invention adopts the above described 
configuration in which M=0, (3 0 =1 are assumed. 

According to this configuration, since it is 
possible to decode a code that is coded by estimating 
the spectrum based on the filter whose characteristic 
15 is defined only by pitch coefficient T and thereby obtain 
the effect that the estimated value of the spectrum can 
be decoded at a low bit rate. 

Furthermore, a fifth mode of the spectrum decoding 
method of the present invention has a configuration in 
20 which the outline of the spectrum is decoded for each 
subband determined by pitch coefficient T. 

According to this configuration, the spectral 
outline calculated for each subband having an appropriate 
bandwidth can be decoded, and therefore it is possible 
25 to prevent discontinuity of energy of the spectrum and 
improve quality. 

Furthermore, a sixth mode of the spectrum decoding 
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method of the present invention adopts the above described 
configuration in which the first signal is generated from 
a signal decoded in a lower layer or a signal obtained 
by upsampling this signal. 
5 According to this configuration, it is possible to 

decode a code that is coded through hierarchical coding 
made up of a coding section with a plurality of layers 
and thereby obtain the effect that a decoded signal can 
be obtained at a low bit rate and with high quality. 

10 The acoustic signal transmission apparatus of the 

present invention adopts a configuration comprising an 
acoustic input apparatus that converts an acoustic signal 
such as a music sound and voice to an electric signal, 
an A/D conversion apparatus that converts a signal output 

15 from an acoustic input section to a digital signal, a 
coding apparatus that performs coding using a method 
including one spectral coding scheme according to one 
of claims 1 to 6 which performs coding on the digital 
signal output from this A/D conversion apparatus, an RF 

20 modulation apparatus that performs modulation processing 
or the like on the code output from this acoustic coding 
apparatus and a transmission antenna that converts a 
signal output from this RF modulation apparatus to a radio 
wave and transmits the signal. 

25 According to this configuration, it is possible to 

provide a coding apparatus that performs coding 
efficiently with a small number of bits. 



The acoustic signal decoding apparatus of the 
present invention adopts a configuration including a 
reception antenna that receives a reception radio wave, 
an RF demodulation apparatus that performs demodulation 
5 processing on the signal received from the reception 
antenna, a decoding apparatus that performs decoding 
processing on information obtained by the RF demodulation 
apparatus using the method including one spectrum 
decoding method according to claims 7 to 12, a D/A 

10 conversion apparatus that D/A-converts the digital 
acoustic signal decoded by the acoustic decoding 
apparatus and an acoustic output apparatus that converts 
an electric signal output from the D/A conversion 
apparatus to an acoustic signal. 

15 According to this configuration, it is possible to 

decode a coded acoustic signal efficiently with a small 
number of bits and thereby output a high quality 
hierarchical signal. 

The communication terminal apparatus of the present 

20 invention adopts a configuration comprising at least one 
of the above described acoustic signal transmission 
apparatuses or above described acoustic signal reception 
apparatuses. The base station apparatus of the present 
invention adopts a configuration comprising at least one 

25 of the above described acoustic signal transmission 
apparatuses or above described acoustic signal reception 
apparatuses . 



According to this configuration, it is possible to 
provide a communication terminal apparatus or a base 
station apparatus that codes an acoustic signal 
elficiently with a small number of bits. Furthermore, 
5 this configuration can also provide a communication 
terminal apparatus or base station apparatus capable of 
decoding a coded acoustic signal efficiently with a small 
number of bits . 

This application is based on Japanese Patent 
10 Application No . 2 0 0 3 - 3 6 3 0 8 0 filed on October 23, 2003, 
entire content of which is expressly incorporated by 
reference herein. 

Industrial Applicability 

15 The present invention can code a spectrum at a low 

bit rate and with high quality and is suitable for use 
in a transmission apparatus or reception apparatus or 
the like. Further, applying the present invention to 
hierarchical coding enables a voice signal or audio signal 

20 to be coded at a low bit rate and with high quality, which 
is suitable for use in a mobile station apparatus, base 
station apparatus or the like in a mobile communication 
system . 
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